Mining Unusual Patterns by Multi-Dimensional Analysis of Data Streams
نویسنده
چکیده
It has been popularly recognized that stream data represents an important form of data, with broad applications. There have been a lot of studies on effective stream data management and query processing, as well as some recent studies on stream data mining. Although this is a promising direction, most existing studies have not paid enough attention to one critical fact: most data streams reside at a rather low level of abstraction and are multi-dimensional in nature, whereas most analysts are interested in finding characteristic features, unusual patterns, and dynamic changes (such as trends and outliers) at relatively high levels of abstraction and in certain multi-dimensional space. To accomplish such tasks, one may need to develop effective mechanisms for on-line, multi-dimensional analysis and mining of stream data. This poses great challenges on system architecture, implementation methodology, algorithm development, and performance tuning. In this paper, we discuss the issues related to effective, on-line, multi-dimensional analysis and mining of unusual events and patterns in data streams, including research challenges, potential architectures, and implementation methodologies.
منابع مشابه
Mining Frequent Patterns in Uncertain and Relational Data Streams using the Landmark Windows
Todays, in many modern applications, we search for frequent and repeating patterns in the analyzed data sets. In this search, we look for patterns that frequently appear in data set and mark them as frequent patterns to enable users to make decisions based on these discoveries. Most algorithms presented in the context of data stream mining and frequent pattern detection, work either on uncertai...
متن کاملProposing an approach to calculate headway intervals to improve bus fleet scheduling using a data mining algorithm
The growth of AVL (Automatic Vehicle Location) systems leads to huge amount of data about different parts of bus fleet (buses, stations, passenger, etc.) which is very useful to improve bus fleet efficiency. In addition, by processing fleet and passengers’ historical data it is possible to detect passenger’s behavioral patterns in different parts of the day and to use it in order to improve fle...
متن کاملMining multi-dimensional concept-drifting data streams using Bayesian network classifiers
In recent years, a plethora of approaches have been proposed to deal with the increasingly challenging task of mining concept-drifting data streams. However, most of these approaches can only be applied to uni-dimensional classification problems where each input instance has to be assigned to a single output class variable. The problem of mining multi-dimensional data streams, which includes mu...
متن کاملA Sliding Window Algorithm for Relational Frequent Patterns Mining from Data Streams
Some challenges in frequent pattern mining from data streams are the drift of data distribution and the computational efficiency. In this work an additional challenge is considered: data streams describe complex objects modeled by multiple database relations. A multi-relational data mining algorithm is proposed to efficiently discover approximate relational frequent patterns over a sliding time...
متن کاملMulti-Dimensional Analysis of Data Streams Using Stream Cubes
Large volumes of dynamic stream data pose great challenges to its analysis. Besides its dynamic and transient behavior, stream data has another important characteristic: multi-dimensionality. Much of stream data resides at a multidimensional space and at rather low level of abstraction, whereas most analysts are interested in relatively high-level dynamic changes in some combination of dimensio...
متن کامل